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Background — While age is a common confounder, its impact on 
health-related quality of life (HRQoL) after total hip replacement 
is uncertain. This could be due to improper statistical modeling of 
age in previous studies, such as treating age as a linear variable 
or by using age categories. We hypothesized that there is a non- 
linear association between age and HRQoL. 

Methods — We selected a nationwide cohort from the Swed- 
ish Hip Arthroplasty Register of patients operated with total hip 
replacements due to primary osteoarthritis between 2008 and 
2010. For estimating HRQoL, we used the generic health out- 
come questionnaire EQ-5D of the EuroQol group that consits or 2 
parts: the EQ-5D index and the EQ VAS estimates. 

Using linear regression, we modeled the EQ-5D index and the 
EQ VAS against age 1 year after surgery. Instead of using a straight 
line for age, we applied a method called restricted cubic splines 
that allows the line to bend in a controlled manner. Confounding 
was controlled by adjusting for preoperative HRQoL, sex, previ- 
ous contralateral hip surgery, pain, and Charnley classification. 

Results — Complete data on 27,245 patients were available for 
analysis. Both the EQ-5D index and EQ VAS showed a non-linear 
relationship with age. They were fairly unaffected by age until the 
patients were in their late sixties, after which age had a negative 
effect. 

Interpretation — There is a non-linear relationship between 
age and HRQoL, with improvement decreasing in the elderly. 



Although age is an important confounder in most health- 
related outcomes (Harrell 2001), there is no consensus regard- 
ing its impact on health-related quality of life (HRQoL) after 
total hip replacement (THR) (Ethgen et al. 2004). Some stud- 
ies have reported decreasing improvement in HRQoL after 
THR with age (McGuigan et al. 1995, Birdsall et al. 1999, 



Santaguida et al. 2008, Rolfson et al. 2011, Keurentjes et al. 
2013), while others have indicated little, if any, effect (Ris- 
sanen et al. 1996, Jones et al. 2001, Lawless et al. 2012). Fur- 
thermore, whether the effect of age is similar throughout life 
or whether there is an accelerated decline in the elderly has not 
been properly investigated. 

Our lack of understanding of this matter may be due to 
improper methodology when modeling age in statistical 
analysis. The review by Ethgen et al. (2004) involving factors 
affecting HRQoL after knee and hip replacements revealed 
8 approaches to modeling age. Most approaches have cate- 
gorized age, while some have treated it as a linear variable. 
Although a common approach, categorization of a continuous 
variable has major disadvantages and can, under certain cir- 
cumstances, be statistically equivalent to losing one-third of a 
study population (Lagakos 1988). It also assumes that every- 
one within the category has the same HRQoL estimate, thus 
creating an unrealistic scenario. For example, if the research- 
ers decide to use one cut point at retirement age, 65 years, then 
a 64-year-old appears in the model to have more in common 
with a 40-year-old than with a 65-year-old. 

On the other hand, the linear approach assumes that the esti- 
mate follows a straight line, i.e. the change between 55 and 
56 years of age is the same as between 85 and 86 years of 
age. This is not in harmony with everyday clinical perception, 
where the majority of the population is fairly healthy until the 
decade between 60 and 70 years of age, after which there is a 
decline. Changing the linear approach to a method where the 
line is allowed to bend, known as splines or fractioned poly- 
nomials, has for a long time been advocated as a more sound 
approach (Brenner and Blettner 1997). 

We investigated the effect of age on HRQoL after total hip 
arthroplasty by modeling age through splines. We hypothesized 
that age would have a non-linear impact on HRQoL after THR. 



Open Access - This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, 
distribution, and reproduction in any medium, provided the source is credited. 
DOI 10.3109/17453674.2014.916492 



Acta Orthopaedica 2014; 85 (3): 244-249 



245 



Methods 

This was a nationwide prospective cohort study based on the 
Swedish Hip Arthroplasty Registry (SHAR). Since its incep- 
tion in 1979, the SHAR has collected data on all primary and 
secondary THRs performed in Sweden (Karrholm 2010). A 
program for gathering patient-reported outcome measures 
(PROMs) was adopted in 2002 and reached full nationwide 
coverage in 2008. Each hospital is responsible for data collec- 
tion and for uploading the data to an online database. 

Ethics 

The study was conducted in accordance with the Helsinki 
Declaration. The regional ethics committee in Gothenburg, 
Sweden, approved the study (Dnr 380-13). 

Participants 

We included patients who were at least 40 years of age and 
who had THR between January 1, 2008 and December 31, 
2010 for primary osteoarthritis. We excluded those with reop- 
erations and those who had died within 1 .5 years of the index 
operation. If a patient had 2 records due to bilateral surgery, 
only the first hip with complete data was selected. Follow-up 
data were gathered until December 31, 201 1. 

During this period, 38,350 THRs in 35,592 patients with 
primary osteoarthritis who were at least 40 years old were 
performed in Sweden. 34,519 patients (97%) were included in 
the study. The response rate for patients who had answered the 
preoperative questionnaire was 92% (27,382 patients), while 
3,175 patients (68%) who had not answered the preoperative 
form filled out the form 1 year later. 

Variables 

The primary outcomes were the EQ-5D index and EQ VAS 1 
year after surgery. Both measures are derived from the generic 
health outcome questionnaire EQ-5D of the EuroQol group 
(EuroQol Group 1990). The EQ-5D questionnaire consists of 
6 items: 5 questions and the EQ VAS. The questions span 5 
dimensions of HRQoL: (1) mobility, (2) self-care, (3) usual 
activities, (4) pain/discomfort, and (5) anxiety/depression. 
Each dimension has 3 levels of severity, generating a total of 
243 combinations representing different health states. There 
are different value sets that may be used to translate these 
health states into a utility index. We used the Swedish experi- 
ence-based time-trade-off (TTO) value set that translates the 
answers into a score between 0.34 and 0.97, on a scale where 
0 represents death and 0.97 maximum attainable HRQoL by 
the EQ-5D measurement. The EQ VAS, in turn, consists of a 
visual analog scale ranging from 0 to 100 where the patients 
are asked to mark their present health state; 100 corresponds 
to the best health state imaginable and 0 the worst health state 
imaginable. 

The primary exposure was age at the date of surgery. We 
chose sex, patient-reported Charnley class, previous contra- 



lateral THR, and preoperative pain (from a patient-reported 
VAS) as possible confounders. The Charnley classification 
organizes patients into 3 classes: (A) unilateral hip disease, 
(B) bilateral hip disease, or (C) multiple joint disease or other 
disabilities impairing walking ability (Charnley 1972). The 
preoperative values of the EQ-5D index and the EQ VAS were 
modeled against their outcomes in order to obtain the change 
in HRQoL independent of the starting point. 

Statistics 

We used ordinary least-squares linear regression for estima- 
tion of the coefficients. Both age and the preoperative EQ-5D 
index were modeled by restricted cubic splines, also known as 
natural splines. This is a type of spline that uses cubic terms in 
the center of the data and restricts the ends to a straight line, 
preventing the center from distorting the ends. The flexibility 
of a spline is chosen by the number of "knots": more knots 
allow a more detailed description of the relationship. Essen- 
tially, each knot connects 2 sections of the line; for instance, 
a linear spline with 1 knot will have a V-shape, while 2 knots 
allow for an N-shaped relationship. As sharp twists of a linear 
spline are undesirable, the cubic function is applied to smooth 
the curve over the different knots. The position of the knots 
was chosen from different quantiles, depending on the number 
of knots. To avoid over-fitting the spline, the number of knots 
was chosen by the Bayesian information criterion (BIC). 
The BIC is more conservative than the more commonly used 
Akaike information criterion, and was chosen because of the 
large sample size. The restricted cubic spline was chosen in 
order to avoid distortions at the extremes, although alternative 
non-linear models have been investigated (see Supplementary 
data). 

All analyses were performed using R software (ver. 3.0.2) 
(Team 2013), and packages rms (ver. 4.1-0) (Harrell 2013) 
for analysis, Gmisc for table output (ver. 0.6.1.1), ggplot2 for 
graphics (ver. 0.9.3.1) (Wickham 2009), and knitr (ver 1.5) 
(Xie 2013) for reproducible research. Due to non-normally 
distributed outcomes with heteroscedasticity, the confidence 
intervals were calculated using a robust covariance matrix 
(HC3) through the sandwich package (ver 2.3-0) (Burstrom 
et al. 2006). 



Results 

There were no major differences between those with full data 
and those with missing data, although the large sample size 
indicated statistical significance for age, preoperative EQ-5D 
index, and preoperative EQ VAS (Table 1). 

Age 

The majority of patients improved across the age range for 
both outcomes (Table 2). In general, almost twice as many 
failed to improve according to the EQ VAS score compared 
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Table 1 . Basic characteristics of the study population, 
showing the absolute numbers (%) for proportions and 
mean (SD) for continuous variables 



V CI I ICIUIC 


r^omnlptp 
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(n = 27,245) 


MiQ^inn 

i vi tool i ly 

(n = 7,274) 


Female 


15,619 (57%) 


4,118(57%) 


Age 


69 (10) 


69 (11) 


Pain VAS, mm 


62 (16) 


64(16) 


Charnley class 






A 


12,697 (47%) 


1,059 (41%) 


B 


3,341 (12%) 


333 (13%) 


C 


11,207 (41%) 


1,186 (46%) 


EQ-5D 






Preoperatively 


0.73 (0.11) 


0.71 (0.12) 


Postoperatively 


0.88 (0.11) 


0.87 (0.12) 


EQ VAS 






Preoperatively 


54 (22) 


51 (22) 


Postoperatively 


76 (20) 


74 (21) 



to the EQ-5D index (16% vs. 9.4%). This failure to improve 
increased in patients over 80 years of age; 20% of these elderly 
patients failed to improve in EQ VAS score and 13% failed 
to improve in EQ-5D index. However, a minority of patients 
were over 80 years of age (14%). 

The regression model confirmed that age was associated 
with a decrease in the 2 HRQoL outcomes; on average, a 
40-year-old patient compared to an 80-year-old patient had a 
0.030 higher gain in EQ-5D index (95% CI: 0.023-0.037) and 
a 5.4 higher gain in EQ VAS (95% CI: 4.2-6.6). Furthermore, 
the age-related decrease was non-linear for both measure- 
ments, with a decrease that started in the patient's late six- 
ties (Figure 1). The dimensions that corresponded closest to 
the overall spline were mobility and usual activities (data not 
shown). We found no support for interaction between age and 
gender for either outcome. 

EQ-5D and EQ VAS 

The preoperative EQ-5D index and EQ VAS explained sub- 
stantially more than age; 3.9 times for the EQ-5D index and 
1.9 times for the EQ VAS when comparing the impact of the 



variables on the model R-square value. Patients with low pre- 
operative values had the highest gain, although they did not 
reach the same absolute levels (Figure 2). With increasing pre- 
operative values, the improvement lessens and patients with 
preoperative values close to the ceiling actually declined on 
average 1 year later. 

Charnley class explained the largest proportion of the vari- 
ance in the model, as measured by the R-squared: 30% for the 
EQ-5D index and 36% for the EQ VAS. Female gender, previ- 
ous contralateral THR, and Charnley class C had the poorest 
outcomes, while male gender, no contralateral THR, and class 
A had the best outcomes. The figures illustrated this by polar- 
izing the predicted results for these 2 groups (the blue and 
pink lines in Figures 1 and 2). Age did not reach the same 
impact, even at the age extremes for the EQ-5D index, i.e. 
compared 90-year-old male patients in class A, 40-year-old 
female patients in class C had poorer EQ-5D index values 
(-0.012, 95% CI: -0.020 to -0.004). For the EQ VAS out- 
come, age had a stronger affect; female patients at 70 years of 
age in class C performed similarly to male patients at 90 years 
of age in class A (-0.6, 95% CI: -1.8 to 0.6). 



Discussion 

We found a non-linear correlation between age and HRQoL 
one year after surgery. Both scores remained largely unaf- 
fected by age until a downturn in the patient's late sixties. Fur- 
thermore, we also found that the preoperative EQ-5D index 
and EQ VAS values were non-linearly correlated to the post- 
operative values 1 year later. It is worth noting that increased 
age was related to a lower preoperative HRQoL. Since lower 
preoperative HRQoL corresponded to a greater improvement, 
in the majority of the elderly HRQoL increased one year after 
surgery. 

Since the mean age at surgery in this cohort was 69 years, 
one could argue that surgery is performed too late in order to 
maximize HRQoL outcome. On the other hand, the decline 
in improvement may be due to natural age-related deteriora- 
tion of HRQoL that is not hip-related; therefore, it could be 
expected that the intervention would have a smaller effect on 



Table 2. Raw comparison of the median (interquartile range) pre/ postoperative EQ-5D index and EQ VAS for various age groups 







EQ-5D (IQR) 




EQ VAS (IQR) 




Age, years 


n 


Preoperative 


Postoperative 


Change 


Preoperative 


Postoperative 


Change 


<50 


1,216 


0.71 (0.61-0.81) 


0.93 (0.87-0.97) 


0.19(0.10-0.27) 


50 (32-70) 


85 (70-90) 


25 (10-45) 


51-60 


4,726 


0.71 (0.61-0.78) 


0.93 (0.83-0.97) 


0.17 (0.10-0.26) 


50 (35-70) 


80 (70-91) 


25 (8-44) 


61-70 


12,005 


0.74 (0.64-0.81) 


0.93 (0.84-0.97) 


0.16 (0.07-0.25) 


52 (38-71) 


80 (70-92) 


20 (5-40) 


71-80 


1 1 ,788 


0.77 (0.67-0.84) 


0.91 (0.81-0.97) 


0.12 (0.06-0.20) 


54 (40-70) 


80 (61-90) 


20 (2-38) 


81-85 


3,356 


0.74 (0.64-0.81) 


0.87 (0.77-0.97) 


0.10 (0.03-0.20) 


50 (40-70) 


75 (55-89) 


15 (0-35) 


>85 


1,428 


0.71 (0.63-0.80) 


0.87 (0.77-0.93) 


0.13 (0.03-0.22) 


50 (36-70) 


70 (50-85) 


20 (0-35) 
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EQ-5D index 1 year after surgery 



EQ VAS 1 year after surgery 
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Figure 1 . The relationship between the EQ-5D index and the EQ VAS one year postoperatively 
and the patient's age at surgery. Preoperative EQ-5D index and EQ VAS were set to the most 
frequently occurring values (index = 0.87; VAS = 50) and are indicated by the horizontal dashed 
lines. The change before and after surgery is the height above this line, i.e. anything above is an 
improvement. The 2 lines differ only in height; the solid line with blue confidence interval indicates 
the optimal combination of covariates (male sex, first hip, and Charnley class A) while the dotted 
line with pink confidence interval indicates the least favorable combination (female sex, previous 
contralateral hip surgery, and Charnley class C). The pain VAS was set to the median value, 65 
mm. 



EQ-5D index 1 year after surgery 



0.6 - 



0.4 - 



EQ VAS 1 year after surgery 
100 
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First THR 
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Previous contralateral THR 



Figure 2. The relationship between the EQ-5D index and the EQ VAS one year after surgery in 
relation to preoperative values. The 2 lines differ only in height. The solid line with blue confidence 
interval indicates the optimal combination of covariates (male sex, first hip, and Charnley class 
A) while the dotted line with pink confidence interval indicates the least favorable combination 
(female sex, second hip, and Charnley class C). Age and pain were set to the median values, 69 
years and 65 mm. 



the gain in HRQoL in the elderly. HRQoL is also just one out 
of many possible PROM outcomes after THR, and while it is 
an interesting metric, it should be combined with other mea- 
sures during patient consultation (Rolf son et al. 201 1). 

Our results are in line with those from other large cohort 
studies. Pennington et al. (2013) reported on 30,203 patients in 
whom age was correlated with a decrease in the EQ-5D index. 



Judge et al. (2013) showed that when 
categorizing age into < 50, 50 to 60, 
and > 60 years in 1,375 THR patients, 
both the younger and the older catego- 
ries performed worse. They concluded 
that age had a non-linear effect, but 
failed to explore the relationship fur- 
ther. Their findings were also for a hip- 
specific outcome, the Oxford hip score. 
Cushnaghan et al. (2007) showed in 
a small sample of 278 patients with 
THR that the SF-36 score was poorer 
in patients over 67 years of age when 
surveyed 8 years after surgery. It is not 
surprising that many studies have failed 
to detect the correlation for HRQoL 
outcomes (Jones et al. 2001, Ng et al. 
2007, Quintana et al. 2009), as the age 
variable only explained a minor pro- 
portion of the variation. Our data sug- 
gest that adjustment for age above 70 
years will mitigate confounding. While 
not recommended, ignoring age up to 
70 years could be a viable alternative in 
those situations where splines are dif- 
ficult to implement. 

In addition to the non-linearity for 
age, both the preoperative EQ-5D 
index and the EQ-VAS exhibited non- 
linearity. Using the difference between 
the preoperative and the postoperative 
values, as many previous studies have 
done (Rolfson et al. 2011, Grosse Frie 
et al. 2012), will lead to residual con- 
founding as it assumes both linearity 
and that the preoperative HRQoL esti- 
mate equals 1 . Furthermore, the figures 
demonstrate the instrument's ceiling 
effect. When a patient begins with a 
high HRQoL close to the ceiling, he/ 
she will deteriorate on average — as 
any improvement cannot be measured 
beyond the instrument's ceiling. This 
can also be viewed as a regression to 
the mean where extreme observations 
tend to move toward the population 
mean. 

The main strength of our study was the nationwide popula- 
tion, and it should therefore be representative of the average 
patient. The small difference between respondents and non- 
respondents at the 1-year follow-up also support our belief 
that the results can be generalized. 

Another important strength is that we used splines instead of 
categorization. Categorizing continuous variables introduces 
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a cut-point bias (Royston et al. 2006, Williams 2011). Identi- 
fication of the best divisor between groups by optimizing cut 
points can increase the risk of type-I errors. For instance, a 
study that reports a p-value of 0.05 and that has applied opti- 
mal cut points should in reality be reporting a p-value > 0.25 
(Miller and Siegmund 1982, Altman et al. 1994, Hilsenbeck 
and Clark 1996, Lausen and Schumacher 1996). If standard cut 
points or quantiles are used instead of optimal cut points, this 
usually results in a loss of power. For instance, dichotomizing 
a normally distributed continuous variable can be equivalent 
to reducing the study population by a third (Lagakos 1988). 
While these properties are well known among statisticians, we 
have found no clinical articles that have implemented splines 
to model age for HRQoL outcomes. 

Our study was limited by the small amount of data per 
patient. Expanding the routine follow-up program with more 
questions to provide more detailed background information 
may enhance the regression model, but it will jeopardize 
response rates, thus having a negative effect on the generaliz- 
ability. The follow-up was also only 1 year after surgery; how- 
ever, previous studies have shown surprisingly little change 
beyond 1 year (Judge et al. 2013). 

It is important to remember that we looked at the chronolog- 
ical age, not the biological age. Use of a biological marker for 
age may improve the accuracy of the model, as the life span of 
humans is heterogeneous (Simm et al. 2008). This is outside 
the scope of a registry study but may be plausible in a smaller 
study. Similarly, if a more detailed co-morbidity adjustment 
than the Charnley classification was applied, the effect of age 
might be further attenuated and even more difficult to detect. 

We conclude that there is a non-linear relationship between 
age and HRQoL in patients receiving THR that results in resid- 
ual confounding if treated as a simple linear term or categori- 
cally in the regression. The implication of this is important, 
as age is a common confounder for which adjustment is nec- 
essary. Furthermore, we conclude that the non-linearity also 
applies to preoperative EQ-5D index and EQ VAS. Adjusting 
for these as linear variables or using the difference between 
the preoperative and postoperative values will result in residual 
confounding to an even greater extent than that for age. 

Supplementary data 

Appendix is available at Acta's website (www.actaorthop. 
org), identification number 6947. 
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